TinyViT: Fast Pretraining Distillation for Small Vision Transformers
نویسندگان
چکیده
Vision transformer (ViT) recently has drawn great attention in computer vision due to its remarkable model capability. However, most prevailing ViT models suffer from huge number of parameters, restricting their applicability on devices with limited resources. To alleviate this issue, we propose TinyViT, a new family tiny and efficient small transformers pretrained large-scale datasets our proposed fast distillation framework. The central idea is transfer knowledge large ones, while enabling get the dividends massive pretraining data. More specifically, apply during for transfer. logits teacher are sparsified stored disk advance save memory cost computation overheads. student automatically scaled down parameter constraints. Comprehensive experiments demonstrate efficacy TinyViT. It achieves top-1 accuracy 84.8% ImageNet-1k only 21M being comparable Swin-B ImageNet-21k using 4.2 times fewer parameters. Moreover, increasing image resolutions, TinyViT can reach 86.5% accuracy, slightly better than Swin-L 11% Last but not least, good ability various downstream tasks. Code available at https://github.com/microsoft/Cream/tree/main/TinyViT .
منابع مشابه
A Fast Method for Calculation of Transformers Leakage Reactance Using Energy Technique
Energy technique procedure for computing the leakage reactance in transformers is presented. This method is very efficient compared with the use of flux element and image technique and is also remarkably accurate. Examples of calculated leakage inductances and the short circuit impedance are given for illustration. For validation, the results are compared with the results obtained using practic...
متن کاملAbstraction for Shape Analysis with Fast and Precise Transformers
ion for Shape Analysis with Fast and Precise
متن کاملDeriving fast distillation models: diploma thesis proposal
Distillation is the most important separation technology today. A trend in controlling distillation columns economically efficient is going toward using model predictive control (MPC) algorithms, which calculate an optimal input trajectory to the plant based on repeated simulations of a model of the process to predict the future behaviour of the plant with respect to disturbances and control in...
متن کاملFast and Accurate Robot Vision for Vision Based Motion
This paper describes the vision module from the soccer playing robots of the Dutch Team. Fast vision is necessary to get a close coupling with the motion software in order to allow fast turning and dribbling with the ball without loosing it. Accurate vision is necessary for the determination of the robot's position in the field and the accurate estimation of the ball position. Both fast and acc...
متن کاملFast Onboard Stereo Vision for UAVs
In the last decade researchers have built incredible new capabilities for small aircraft, with quadrotors moving from labs to toy stores and with autonomy reaching smaller and smaller vehicles. As the systems, and their payload capacities shrink, we can no longer use typical aircraft sensors such as RADAR, scanning LIDAR, and other active sensing methods for obstacle detection and avoidance. Sm...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Lecture Notes in Computer Science
سال: 2022
ISSN: ['1611-3349', '0302-9743']
DOI: https://doi.org/10.1007/978-3-031-19803-8_5